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Spororoitc  surtace  protein  2  (SSP2)  is  a  140-kDa,  protective  sporozoite  surtacc  protein  from  Plasmodium  yoelii  distinct 
rrom  the  circumsporozoite  protein  (CSP).  A  genomic  clone  containing  the  SSP2  gene  was  isolated  and  sequenced  to  determine 
its  size,  structural  organization  and  deduced  primary  amino  acid  sequence.  The  coding  sequence  consists  of  a  single,  long  open 
reading  frame  encoding  826  amino  acids.  The  overall  structure  of  SSP2  is  similar  to  that  of  the  CSP.  consisting  of  a  central 
region  of  immunogenic  amino  acid  repeats  Hanked  by  non-repetilive  sequence.  SSP2  has  one  copy  of  a  thrombospondin 
repeat  motif  in  common  with  several  cell  adhesion  molecules  as  well  as  with  the  CSP  and  the  thrombospondin  related 
anonymous  protein  (TRAP)  of  P.  falciparum.  Additionally.  SSP2  shares  substantial  sequence  similarity  to  TRAP,  suggesting 
that  TRAP  IS  the  analogue  of  SSP2  in  P.  falciparum. 
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Introduction 

Efforts  to  develop  a  pre-erylhrocylic  stage 
malaria  vaccine  have  focused  almost  entirely 
on  the  circumsporozoite  protein  (CSP)(1].  The 
CSP  is  the  predominant  protein  on  the  surface 
of  the  infective  malaria  sporozoite.  It  is  well 
known  that  immunization  of  humans  or 
animals  with  radiation  attenuated  sporozoites 
induces  solid  sterile  immunity  to  malaria  and 
both  humoral  and  cellular  immune  responses 
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to  the  CSP  (2-6).  Monoclonal  antibodies 
(mAbs)  [7,8,10]  and  cytotoxic  T  cells  [9] 
directed  against  the  CSP  are  protective  in 
passive  transfer.  Nonetheless,  it  has  not  been 
possible  to  induce  active  immunity  with 
recombinant  or  synthetic  vaccines  based  on 
the  CSP  alone  (8,11-17)  comparable  to  that 
achieved  by  immunization  with  irradiated 
sporozoites.  We  have  therefore  attempted  to 
idc..;ify  additional  sporozoite  surface  antigens 
which  might  be  combined  with  the  CSP  in  a 
multicomponent  vaccine.  We  recently  descri¬ 
bed  a  new  sporozoite  surface  antigen,  sporo¬ 
zoite  surface  protein  2  (SSP2),  from 

Plasmodium  yoelii  (5,18).  Monoclonal  antibo¬ 
dies  directed  against  SSP2  recognize  a  i40-kDa 
protein  in  sporozoite  extracts.  Sequence  ana¬ 
lysis  of  a  1.5-kb  genomic  DNA  fragment 
encoding  part  of  SSP2  revealed  an  immuno¬ 
genic  series  of  repeating  amino  acids  and  a 
region  of  similarity  to  (he  region  II  domain  of 
the  CSP  (I).  Mice  immunized  with  P8I5  mouse 
mastocytoma  transfectants  expressing  the 
partial  SSP2  sequence  and  the  CSP  were 
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protected  against  challenge  with  infective 
sporozoites  [19j.  We  report  here  the  character¬ 
ization  of  a  genomic  clone  containing  the 
complete  SSP2  gene. 


Materials  and  Methods 


Piirasiles  and  DNA  isolation.  P.  yoelii  I7X 
(NL)  parasites  were  maintained  and  DNA 
isolation  performed  as  described  previously 
(20], 

Genomic  library  construction  and  screening.  A 
P.  yoelii  genomic  library  was  constructed  using 
2.0-7.0-kb  fragments  generated  by  partial 
DNAse  I  digestion  as  previously  described 
[18].  Purified  insert  DNA  from  /gMSY4  was 
nick  translated  and  used  to  screen  the  library 
under  standard  high  stringency  conditions  [21]. 
Five  positive  clones  were  isolated,  and  one. 
7.PySSP2.iO  was  found  to  contain  a  4.7-kb 
insert  w  hich  overlapped  both  ends  of  z.gMSY4, 

D.V,4  sequencing.  Phage  DNA  of  z.PySSP2.10 
was  prepared  from  liquid  lysates  by  standard 
methods.  Insert  DNA  was  released  by  EcoRI 
digestion  and  the  inserts  were  cloned  into 
Ml3mpl8  and  pUCI8.  Overlapping  clones 
spanning  the  insert  were  generated  in  pUC18 
and  M13  using  exonuclease  111  digestion  [22]. 
Single  and  double  stranded  templates  were 
sequenced  using  the  dideoxy  method  [23]  and 
Sequenase  (U  S.  Biochemical  Corp.,  Cleve¬ 
land,  OH).  Sequence  analysis  was  carried  out 
using  Genepro  4.2  and  DNASIS  software. 


Results 

A  2.gtll  DNase  I  genomic  library  was 
screened  with  the  1.5-kb  fragment  of  the 
SSP2  gene  contained  in  /(gMSY4.  Five  posi¬ 
tive  clones  were  obtained.  One,  APySSP2.10, 
contained  a  4.7-kb  insert  which  included 
within  it  the  complete  sequence  of  the 
/gMSY4  sequence.  The  4.7-kb  insert  was 
subcloned  into  pUC18  and  M13mpl8.  Nested 


deletions  were  prepared  using  exonucicasclll 
[22]  and  the  complete  .sequence  determined  by 
the  Sanger  dideoxy  method  [23]. 

Fig.  1  shows  the  sequence  of  the  4,7-kb 
insert  of  /.PySSP2,10.  A  single  long  open 
reading  frame  is  present  and  includes  the 
previously  described  sequence  of  /gMSY4 
[18].  The  AT  content  of  the  coding  and 
noncoding  regions.  63.2%  and  80.7%  respec¬ 
tively,  are  similar  to  those  found  in  other 
Plasmodium  genes  [24].  The  sequence  encodes  a 
polypeptide  containing  826  amino  acids  with  a 
calculated  molecular  weight  of  91  300.  Several 
possibilities  may  account  for  the  discrepancy 
between  this  calculated  molecular  weight  and 
the  observed  molecular  weight,  140000  [5,18]. 
First,  the  gene  might  contain  additional  exons. 
However,  no  additional  long  open  reading 
frames  were  found  either  700  bp  5'  to  the 
initiation  codon  or  1500  bp  3'  to  the  first  in 
frame  slop  codon.  No  Plasmodium  consensus 
intron  boundary  sequences  [24]  were  found  in 
the  flanking  regions.  A  large  intron  could 
extend  beyond  the  region  we  have  sequenced, 
but  previously  described  Plasmodium  introns 
have  been  less  than  600  bp  long.  Second,  SSP2 
may  be  a  glycoprotein,  and  indeed,  there  are 
several  consensus  ,V-glycosylation  sites  in  the 
sequence  (Fig.  1).  Finally,  the  protein  may 
migrate  anomalously  in,  SDS-PAGE  gels, 
perhaps  as  a  result  of  its  very  high  prolinc 
content. 

The  deduced  amino  acid  sequence  of  SSP2  is 
shown  in  Fig.  I  and  a  map  of  the  sequence  in 
Fig.  2.  Like  a  number  of  other  Plasmodium 
surface  antigens  [24],  the  deduced  amino  acid 
sequence  contains  tandem  repeats  of  simple 
amino  acid  repeats  and  is  particularly  rich  in 
prolinc  (18.0%)  and  asparagine  (21.2%).  The 
general  structure  of  SSP2  is  similar  to  that  of 
the  eSP  (Fig.  2).  There  is  a  central  region  of 
short,  repeated  peptide  sequences  flanked  on 
both  sides  by  non-repetitive  sequence.  Hydro- 
phobicity  analysis  [25]  identified  a  putative  N- 
terminal  hydrophobic  leader  [26]  as  well  as 
putative  transmembrane  and  cytoplasmic  do¬ 
mains  [27]  near  the  carboxy  terminus  (Figs.  1 
and  2).  It  is  interesting,  however,  that  while 
SSP2  hax  both  a  transmembrane  domain  and  a 
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CATTAA^ACCV.TTAAAAA>.iT^Ay^TTTTATAAAT^TTCTT7AATT^TCTTTATA:ATA:XA?ATATATATACATTTATATATACTCTTCT7C^nTTATC^,ATT^AAA^AATA-ATAAT  '.20 

ATCCAT7ATAT7:ATTnT-:AACAAr7AAAAA:ATATAAAATGTACCCCTTCTCCTTCAACCAAi:AT7rtTTATATTIAACTGTTCTATCTTrrTTTACATATATTTCTTCACA::CTTT  2  40 

CGCATCATATTAAATAATATAATTTTCGAAGACAAATATTTTTAAATACTTTTTrTACTCCTTCCATtATTTTTATUATATATATTAACATTCATAAAATATATATTTCTTGACTCTTGC  3  AO 

TTCCCAG’rTATr'J^TTAGCTATATTTTTAAATACTAAATArATTr7TI7AAATTCCTTATCA7CA7A7TCTAATCCG1ATTA:ATTCCCTATCTC7ATATATATAACCCAAAAAXACC  410 

aaaacatitaatttcctcagacoctattcaattaaattaactatatatca&ttttaiataagaaaacctaacacactctctctctatatatatataattgcaa^cctctacacattttta  aoo 

TATATCCCCAAATAC7AAATACAAAATAATTCCTCACTTTTATTCTCT7ACATATAt7ArAATACATACA7Ar>CACATAATTTTACCCATTCCCCATTTCTC7TATACACACAAACATC  720 

H  1 

- - -  —  hydrophobic  1  **d«r  »equ#nc4 - - -  —  ““  I 

AACCTCT7AO.>wAA:AC7AAA7A7ATTTrTG7TCTCCTTT7AT7ATCCATAACCCTCTTCCTTAATCCTCACCAAACTCTTCACCAAATAAACTA7AC7CAAr>ACTArG7ACCGAACAA  140 

KLtCNSKYirVVLLLCISVfLllCOtTLOtlKYStEVCTeO  41 

ATCGACAT7CA7A:A7TACTACA7CCTTCACCAACTATTGGTTATAGCAATTCCAACCCTCATCTTATTCCAAIGCTTAATACTTTCCTTCATAACT7AAATATT7CAAATGA7CAAArr  »40 

lOIMILLDGSCSlCYSNKBAHVirMLNTLVDNLIIISNOei  tl 

AATCTATCT7TCACACTTTTT7CAACAAATTCACC7CAATTAATTAAACTTAAACCATATCCATCCACTACTAAAGACTCCCTACCTTTTATACTTCCACATCTCCAAAATAATTATTCA  lOtO 

MVSLTLrSTNSReLIKLKCTCSTSRDSLlkPILAHLONMyS  121 

CCAAATCC7AA7ACAAATT7AACCACTCCArrATTCGTTGT7CATACTTTAATTAATCAAACAATCTATCGACCCGATCCAATACAATTACCTATTATATTAACACATCCTATCCCAAAT  1200 

PNCNTULTSALLVVOTLlNeRMYAPDAlOLAl  ILTD61PN  14l 

CATrrACCTACATCTACTCCGCnCTGCATCAATTAAAAACAAAACATCTAAATCTACCAArrATACCTGTTCCTCCACCTCTTAATAACCAATATAATACAATTTTACTTCCATCTCAT  1320 

OLPRSTAVVHOLKRRMVNVAtlCVCACVMNeVNRlLVCCO  201 

AGATACGCACCAT&CCCATAC7ACTCTTCTCGTAC77CCAATCAACCCCAAAATATCATAAAACCTT7TCTTACTA.SACTTTCTCACCAAC7ACAAAGAATTCCTCATTCTCCAAAATCC  1440 

AYAPCPYTSSGSWNEAOMNtKrPLTKVCOEVRAIAHCCKM  241 

I --ihromboipondln  notlf — J 

CAAGAATCCACTCAATCTTCTACTACTTGTGATCAACGAAGAAAAATTACAACAACACAAATATTACAICCTCCATCTCTTACTCACATCACTACTCCATCTAACCTTCCTGATTCCCCA  1$«0 

CEUSECSTfCOECAKXRRAOILHPCCVSeNTTPCKVADCP  211 

I - )-iMr  repeats — - - - - - 

CAAATACCAATACCTCCTC7CATCCC7AATAAAAT7CCAGAAAACCCATCAAACCCACAACAACCAGTAAATCCAAACCATCCAAACCATCCAAACAACCCAAACAACCCAAATAACCCA  14*0 

OIFXPPVlPMRXPERPSNreePVJIPNDPIIDPMNPNNPNIlP  321 


AaCAACCCAAACAACCCAAATAACCCAAACAACCCAAACAACCCAAACAACCCAAACAACCCAAACAATCCAAATAACCCAAATAACCCAXACAACCCAAATAACCCAAATAACCCAAAC  liOO 

MMPMNPMilPNIlPlINPNMPMlfPMlfPlllIPNMPIIHPMNPMNPII  341 

. .  I  — 

AACCCAAATXACCCAAACAACCCAAATAACCCAAATAACCCAAA7AACCCAAATAACCCAAACGATCCATCAAACCCAAACAACCACCCAAAAACCCCAAACCCAAAAACCCCAAACCCA  1420 

MPHNPMMPIINPKMPMNPNNPNOPSMPNNNPRAAIIPRIIRNP  401 

-  - - repeats*-" - * - -  -  - - ............. - - 

AACAACCCAAAACCAAACAAOCCAAACCCAAACAACCCAAACCCAAACGAACCATCAAACCCAAACAACCCAAACCCAAACCAACCATCAAACCCAAACAACCCAAACCCAAACGAACCA  2040 

NRPRPNKPHPlIRPMPNEPSMPMRPMPNEPSNPMRpMPNeP  441 


TCAAACCCAAACAACCCAAACCCAAATCACCCATCAAACCCAAACAACCCAAACCCAAATCACCCArrAAACCCAAACGACCCATCAAATCCAAACCACCCATCAAACCCAAATCCCCCA  2140 

SNPNRPNPREPSNPNRPNPMEPLMPMEPSMPHtPSNPNAP  401 

. . 

TCAAACCCAAACCAACCATCAAACCCAAATGAACCATCAAACCCAAATCACCCArCAAACCCAAACCAACCATCAAACCCAAATCAACCATCAAACCCAAAAAACCCATCAAACCCAAAT  2210 
SHPMEPSIPMEPRIfPMCPSNPNEPSNPItEPSHPRRPRRPN  321 

CACCCATCAAACCCAAATGACCCATTAAACCCAAATGACCCATCAAACCCAAACGAACCATCAAACCCAAACCAACCATCAAACCCAGAAGAACCATCAAACCCTAAACACCCATCAAAC  24  00 

EPSRPRCPLRPRCPSMPRCPSRPNErRNPEEPSMPREPSN  341 

CCAAAC(iAACCATCAAACCCAC.\AGACCCAAACCCAGAAGAACCATCAAACCC?AAACACCCATCAAACCCACAACACCCAATAAACCCACAACAACTAAACCCAAAACACCCATCAAAC  2320 

pMEPSNPEEPNPeePSNPRePSilPEEPtllPEELNPREPSH  401 

—  ...........  —  repeats - — — ..............j 

CCAGAACAATCCAACCCCAAACAGCCAATAAACCCACAAGAATCCAACCCCAAACACCCAATAAACCCACAACATAATCAAAATCCArrCATAATACAAGATCAACCTATACAACCCAGA  2440 

PEESMPKePtNPEESMPRePlRPEDNENPLIIODCPiePR  441 

AA7GATrCAAATGTAATACCAAT77TACCTATCATCCCACAAAACCCTAA7AATA7CCCAACCAATC7ACCACAAAATCCATCtCACTCAGAACTACAATATCCAAGACCAAATCATAAT  2740 

f0S».VlPlLPIIP0RCR«IP5lll.PEIPSDSEVeYPRPllDII  411 

CCTCAAAATTCAAA7AA7AC7ATCAAATCAAAAAAAAA7AtACCCAACCACCC7A7ACCATCACCACCTCATAACCCATATAAACC7CACCAACAAAGAA7ACCAAAACCTCA7CCATCA  2110 

GENSMMTNRSRRMIPNEPIPSPGDVPYRGHEERIPRPHRS  721 

AATGATGACTATCTATATGATAA7AATCTAAATAAAAATAATAAACATCAACCACykAATTCCAAATAATCAC7ATCAACACGATAAAAATAAAAACCACTCTAAATC7AATAATCCATAT  3000 

NOOYVYORMVNRNNROEPSIPNRCYEEDRNRJIOSRS.RRCY  741 

J - .... - - — 'transmcnbrane  domain*  —  - — — ) 

AAAATTCCTCCTCCTATTATTCCACGATTACCTATACrrCCATGTCCACCTCTTCCTTATAATTTTATACCACGAACTACCCCTCCACcATTCGCTCCAGCAGACCCTCCACCTTTTCAA  3120 

rIACCIICCLAILCCACVCYNPIACSSAACLACAEPAPPE  101 

CATC7AATTCCACATCATCACAAACATATTGTTGAAAACGAACACTrrAAATTACCTCAAGATAATCAC7CCAACTAATTT7AATAAACCTATATATCCACTTTATTATTtT7ATATTAC  3240 

OVlPDODROlVCRtOrRtPEDVOWM*  124 

ATACAAATCTGATATATCTTTCTCTTTTTrTTCCTTTTAAATATTATCTATGATTATATATATAATATACCTTTAAATAATAAATTCATAAATTCCCTTCTCTTTAAATTCTTTCTCTTT  3340 

CTTTACACTTTATTCCTT7TTCCTC7T7TTCTTCCTTTTTTTrTCTATCATTAAC7TATT7TAAAT7AACACTTTCA7AAA7TCTCATCT7TTTATCTTATTCATTCAATTATATATCCA  3410 

T7TATT77CA7A7TT777T77AACCA7rrTTTTTTAAC7ATTTTTTTTTAACTAATTGTCTCCTTATAA7A7A7A7ATT7AT7TA7AC7CCAATATTTAATCCTTACAATTAT7CTTAAT  3400 

ATAAAAAAAAAAAAAAAAAAAAAAAACTTAAAAC7TAATAACATTTTTACCTT7CTATATT7ACACCC7ATTTACTATTTTCAAAA7AATTATGAATAAAACAAAAAACTCATAA7ACAT  3720 

AA7AAAATCAA77CC7AAAAAAA7ACACAAA7CCACCAATATTATCCA7AAAAAACAAATAAACAAAATCTCATTATTT7AAAA7TTACAAAACATAAAAATAATCCTCrrAACTTTTAT  3140 

CAACTAAAAACTCTCATAAAAAAAAATCATCCAATCTTAAAAAACAGAA7ATC7AAACTTCCCTCATCA7T7T7TCAACTAT7ATCATCCTTATTATACATATtTCAAATT7TTAA77TT  3440 

TCA7ATAAACTTT7CCAAAAT7CATAATT77C7ArrTTCATATCTCTCTTATTATGTTTCCATTCAT77ATTA7AT7ATTTCTC7AAAAAT7AACATCA7A7ATrrTTACCATAT7TCAC  4040 

AAATTCTCAAATTCC7TG7ATTCTCTTTTTGAAAAAATATGACCATTTTrrTTAATATTATTAT7CTCTTTCATTT7ACAAAAAAATAACAATCCATTTrrTnTTTTAAATCATTAAAA  4200 

TTAA7TTTA77T77AACAGAACCAA7ATCA7TTTCCAATTTATTTTCATrrTCTTCAT7ATTTCTACTTCCA7C7CTCCAT7T7TTCTTTGCACCATATAAATTTATTAATTCTTTA7TC  4  320 

C71AA7TCT7  7ATTT7C7AAATTTATTAA7TTTGACTATTCTA7A7ATATTTCATCCATTTTTCTCT7CTTCA7AT7ATTACGAAACCAAATAATATTCTTATTTCTATCAA7ACAT7TA  4440 

TTTTTATTrr7CCTTTTA7AAAAAATCGAA7AAAATTCtAATAACCCCAT7ACCTCTACCTTTTCATAA7TTCAACTATCATA7AATCTCAATAAATTTATAATCTTCCCTTCTACTTCC  4  340 

TTtA7TT7ACATATTTC7TTTCCATATTC7ATAT7ACCAATCATCTTT7CCTTTCTrCTTTATnATCT7ATATCCCATTT7AC7TATACCTCCTCT7TCCTCCATTTTCTCC  4473 


Fig.  I.  Sequence  of  the  4.7-kb  insert  of  xPySSP2  10.  The  inferred  amino  acid  sequence  of  SSP2  is  shown  below  the  open 
reading  frame.  The  location  of  the  conserved  thrombospondin  motif,  of  the  repeated  peptide  motifs,  and  of  the  putative 
hydrophobic  leader  and  transmembrane  domain  is  indicated.  Potential  A'-glycosylation  sites  are  underlined. 


cytoplasmic  domain,  the  CSP  has  only  a 
membrane  anchor,  suggesting  that  these  2 
proteins  interact  differently  with  the  mem¬ 
brane. 


The  arrangement  of  short  amino  acid  repeats 
in  SSP2  is  complex.  There  is  first  a  region  of  23 
perfect  repeats  of  the  tripeptidc  PNN,  followed 
by  3  degenerate  copies,  PND  PSN  PNN.  There 
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Fig.  2.  Structure  of  SSP2  and  CSP.  Shown  are  schematic  diagrams  of  the  sequence  of  SSP2  (A)  and  the  P.  yoetii  CSP  (B)  (33). 

The  diagrams  are  drawn  approximately  to  scale. 

is  then  a  short  segment  of  non-repetitive  alternating  with  4  copies  of  PNEPSN,  and  18 
sequence,  followed  by  3  tandem  copies  of  the  tandem  degenerate  copies  of  PNEPSN.  These 

pentamer  PNKP(K/N),  4  copies  of  PNKPN  tandem  repeats  are  followed  by  a  short  region 


MKLLGNSKYIFWLLLCISVFL-NGOE - TLDEIKYSEEVCTEQIDIHtLLDGSGSIC 


.1NHLGNVKYLVlVFl-IFFDLFLVNGW)VQKNlVDEIKYSEEVCNr>QVDl,YI.U1DCSGSIR 
YSKWKAHVIPMLNTLVDNLNlSNDEINVSLTLrSTNSRELIKLKGYGSTSKDSLRrilAH 
RHNWVNHAVPLAMKLtOQl-NLNDNAY  HLYVNVFSNNAKE  n  RLHSDASKNKEKALI 1 1 RS 
lqnnyspngntnltsallwdtlinermyrpdaiolaijltdgipwlprstawhoi-kr 
LLSTNLPYGRTNLTDALLQVRKHLNDRINRENANQLWILTOGIPDSIODSLKESRKLSD 

khvnvaiigvgagvnneynrilvgcdry-apcpyyssgswneaqnmikpfltkvcqever 


rgvkiavfgigqginvafnrflvgchpsdgkcnlyadsahenvknvigpfmkavcvevek 
I  AHCGKWEEBSECSUCDEGRK  I RRRQI LHPGC  VSEMTTPCKVRDCP 


SSP2  SNNGYKIAGGI  IGGLAILGCAGVGYNFIAOSSAAGLAGAEPAPFEDVIPDDDKDIVENEQ 


TRAP  SDNKYmAGGIAGGLALlACAGLAYKFWPGAATPYAG-EPAPFDETLGEEDKDLDEPEQ 
SSP2  FKLPEONDtm 


TRAP  FRLPEENENN 


Fig.  3.  Alignment  of  the  N-terminal  (A)  and  C-terminal  (B)  regions  of  P.  yotHi  SSP2  and  P.  falciparwn  TRAP.  indicates 
identical  amino  acids,  indicates  conservative  substitutions.  The  thrombospondin  repeat  motif  is  underlined. 
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in  which  short  repeal  motifs,  PEE  and  PSN. 
are  interspersed  with  non-repetitive  sequence, 
l  inally,  there  is  a  tandem  duplication  of  the 
11-mc'r.  PEESNPKEPIN.  All  of  the  repeat 
sequences  in  .SSP2  arc  clearly  distinct  from  the 
repeats  in  the  P.  yoelii  CSP,  QGPGAP  and 
QQPP.  The  only  common  feature  which  the 
SSP2  repeats  share  with  the  CSP  repeats  is  the 
general  structure  PXXPXX,  which  might  be 
expected  to  imparl  to  the  repeat  domains  of 
both  proteins  a  structure  rich  in  ^-bends  (28). 

SSP2  shares  sequence  motifs  with  several 
plasmodial  proteins  and  molecules  involved  in 
cell  adhesion.  Thrombospondin,  the  CSP 
region  II,  properdin,  the  terminal  complement 
components  and  the  thrombospondin  related 
anonymous  protein  share  similarities  based 
around  the  nonapeptidc,  WSPCSVTCG 
[29,30].  This  sequence  is  found  in  3  copies  in 
thrombospondin,  6  copies  in  properdin  and 
one  copy  in  all  CS  proteins  sequenced  to  date. 
A  similar  sequence,  underlined  in  Fig.  1,  is  also 
found  in  SSP2.  In  SSP2  this  thrombospondin 
motif  is  found  amino  terminal  to  the  central 
repeat  region,  while  the  analogous  sequence  in 
the  CSP  is  found  in  Region  II,  carboxy 
terminal  to  the  repeats. 

The  N-terminal  and  C-terminal  regions  of 
SSP2  bear  a  remarkable  similarity  to  TRAP 
which  extends  well  beyond  the  similarity  to  the 
thrombospondin  motif.  Fig.  3A  shows  an 
alignment  of  the  N-terminal  regions  of  SSP2 
and  TRAP.  Over  a  region  of  281  amino  acids, 
there  is  43%  similarity  at  the  amino  acid  level. 
Ten  of  1 1  cysteine  residues  are  conserved,  the 
only  exception  being  a  single  cysteine  in  the 
putative  hydrophobic  leader  of  SSP2.  Fig.  3B 
show  the  alignment  of  the  C-terminal  regions 
of  the  2  proteins.  Over  a  region  of  71  amino 
acids,  there  is  56%  identity  at  the  amino  acid 
level.  SSP2  and  TRAP  may  be  members  of  a 
protein  family  involved  in  interaction  between 
the  sporozoite  and  erythrocytic  stages  of 
Plasmodium  and  the  cells  of  the  host. 


Discussion 

SSP2  is  a  new,  non-CSP,  140-kDa  sporo¬ 


zoite  surface  antigen  from  P.  yoclii  |18j.  We 
have  recently  observed  that  immunization  of 
mice  with  a  combination  of  P8I5  mouse 
mastocytoma  transfeclanls  expressing  the 
CSP  and  the  original  1.5-kb  fragment  of 
SSP2  [18]  are  protected  against  challenge  with 
infective  P.  yoclii  sporozoites  [19],  SSP2  and  its 
presumed  homologs  in  the  human  Plasmodium 
species  are  therefore  important  vaccine  candi¬ 
dates.  We  have  here  reported  the  complete 
sequence  of  the  P.  yoelii  SSP2  gene. 

The  deduced  amino  acid  sequence  of  SSP2 
shares  a  number  of  characteristics  with  other 
Plasmodium  surface  antigens  in  general  and 
with  the  CS  protein  in  particular.  First,  it  is 
characterized  by  a  central  repeat  region 
consisting  of  tandem  repeats  of  several  differ¬ 
ent  short  peptide  sequences.  As  in  the  case  of 
the  CSP  repeats  from  many  Plasmodium 
species,  the  amino  acids  used  in  the  SSP2 
repeats  are  chosen  from  a  restricted  set  of 
amino  acids  (P,N,E,Q,G,D,A,R,V  for  CSP 
repeats  and  P,N,E,K,S,I  for  the  SSP2  re¬ 
peats).  As  in  the  P.  yoelii  CSP,  there  are 
several  different  repeat  units  in  the  repeat 
region,  however,  the  organization  of  the 
repeats  in  SSP2  is  somewhat  more  complex 
than  that  in  the  CSP.  There  are  2  major  repeat 
regions,  one  consisting  of  tandem  repeats  of 
the  tripeplide  PNN,  the  second  consisting  of  2 
basic  repeat  units,  PNKPN  and  PNEPSN.  The 
units  are  intercalated  in  the  general  structure 
AAAABABABABBBB,  where  A  =  PNKPN 
and  B  =  PNEPSN. 'This  organization  could 
have  arisen  from  an  ancestral  11-mer  repeat 
unit  AB  =  PNKPNPNEPSN  by  duplication 
of  the  component  5-  and  6-mers  at  the  amino 
and  carboxy  terminal  ends  of  an  ancestral  1 1- 
mcr  repeat  region.  One  would  expect  that  over 
the  course  of  evolutionary  time  the  central, 
alternating  AB  repeats  could  be  eliminated  by 
homologous  recombination,  resulting  in  a 
simpler  AAAAABBBBBB  organization, 
which  is  in  fact  observed  in  a  number  of  CSP 
repeat  regions  [31-33].  The  SSP2  repeat  region 
may,  therefore,  represent  an  intermediate  step 
in  a  general  mechanism  in  Plasmodium  antigen 
genes  by  which  an  ancestral  tandem  duplica¬ 
tion  of  a  relatively  long  sequence  evolves  into  a 
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repeal  region  characieri/ed  by  tandem  repeats 
of  2  or  more  different  short  peptide  sequences. 

Sporozoites  which  have  been  inoculated  into 
the  mammalian  host  progress  rapidly  from  the 
circulation  to  infect  hepatocytes.  It  is  likely 
that  rapid  homing  to  the  liver  requires  specific 
cell-cell  interaction  between  the  sporozoite  and 
hepatocytes,  Kupffer  cells,  or  endothelial  cells 
in  the  hepatic  circulation.  It  is  thus  interesting 
to  find  that  SSP2  shares  a  sequence  with  the 
cell  adhesion  molecules,  thrombospondin, 
properdin,  and  the  terminal  complement 
components,  as  well  as  with  the  CSP  Region 
II  and  another  Plasmodium  antigen,  throm¬ 
bospondin  related  anonymous  protein 
(TRAP).  These  thrombospondin  motifs  are 
centered  on  the  nonapeptide  WSPCSVTCG, 
and  are  found  in  3  copies  in  thrombospondin, 
6  copies  in  properdin,  and  one  copy  in  all  CS 
proteins  sequenced  to  date  (29,30).  SSP2  may 
also  have  a  role  in  cell-cell  interactions  between 
the  sporozoite  and  the  mammalian  host. 

SSP2  bears  a  striking  similarity  to  TRAP 
which  extends  considerably  beyond  the  throm¬ 
bospondin  repeat  motif.  The  first  281  amino 
acids  of  SSP2  and  TRAP  have  a  43% 
similarity  at  the  amino  acid  level.  Ten  of  II 
cysteines  in  the  amino  terminal  sequence  of 
SSP2  are  identically  conserved  in  TRAP,  the 
only  exception  being  a  single  cysteine  in  the 
putative  hydrophobic  leader  of  SSP2.  A  region 
of  56%  identity  extending  over  71  amino  acids 
is  found  at  the  carboxy  terminus.  The 
similarity  in  overall  structure,  as  well  as  the 
striking  amino  acid  sequence  similarities  at  the 
amino  and  carboxy  termini  strongly  suggest 
that  TRAP  is  the  P.  falciparum  analogue  of 
SSP2.  It  is  interesting  that  has  a  large  repeat 
region,  while  TRAP,  an  apparently  closely 
related  protein,  has  none.  If  SSP2  and  TRAP 
are  indeed  analogous  proteins  with  the  same 
function,  the  absence  of  repeats  in  TRAP  may 
call  into  question  the  functional  importance  of 
repeats  in  Plasmodium  antigens  generally. 
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